# OCR Optimization
Mlcd Vit Large Patch14 336
Apache-2.0
A visual feature extraction model based on ViT-L/14@336px architecture, surpassing CLIP benchmarks in multiple multimodal tasks
Multimodal Fusion
Safetensors
M
DeepGlint-AI
1,450
10
Detr Resnet 50 Finetuned OCR
Apache-2.0
An OCR model fine-tuned from facebook/detr-resnet-50 for object detection tasks
Text Recognition
Transformers

D
ismadoukkali
15
1
Featured Recommended AI Models